5 research outputs found

    Towards Generalizable SER: Soft Labeling and Data Augmentation for Modeling Temporal Emotion Shifts in Large-Scale Multilingual Speech

    Full text link
    Recognizing emotions in spoken communication is crucial for advanced human-machine interaction. Current emotion detection methodologies often display biases when applied cross-corpus. To address this, our study amalgamates 16 diverse datasets, resulting in 375 hours of data across languages like English, Chinese, and Japanese. We propose a soft labeling system to capture gradational emotional intensities. Using the Whisper encoder and data augmentation methods inspired by contrastive learning, our method emphasizes the temporal dynamics of emotions. Our validation on four multilingual datasets demonstrates notable zero-shot generalization. We publish our open source model weights and initial promising results after fine-tuning on Hume-Prosody.Comment: Accepted as talk at NeurIPS ML for Audio worksho

    A unified ontology-based data integration approach for the internet of things

    Get PDF
    Data integration enables combining data from various data sources in a standard format. Internet of things (IoT) applications use ontology approaches to provide a machine-understandable conceptualization of a domain. We propose a unified ontology schema approach to solve all IoT integration problems at once. The data unification layer maps data from different formats to data patterns based on the unified ontology model. This paper proposes a middleware consisting of an ontology-based approach that collects data from different devices. IoT middleware requires an additional semantic layer for cloud-based IoT platforms to build a schema for data generated from diverse sources. We tested the proposed model on real data consisting of approximately 160,000 readings from various sources in different formats like CSV, JSON, raw data, and XML. The data were collected through the file transfer protocol (FTP) and generated 960,000 resource description framework (RDF) triples. We evaluated the proposed approach by running different queries on different machines on SPARQL protocol and RDF query language (SPARQL) endpoints to check query processing time, validation of integration, and performance of the unified ontology model. The average response time for query execution on generated RDF triples on the three servers were approximately 0.144 seconds, 0.070 seconds, 0.062 seconds, respectively

    Data Collection Protocols For Wireless Sensor Networks

    Get PDF
    Data collection in wireless sensor networks (WSNs) has a significant impact on the networkā€™s performance and lifetime. Recently, several data collection techniques that use mobile elements (MEs) have been recommended, especially techniques that focus on maximising data delivery. However, energy consumption and the time required for data collection are significant for many WSN applications, particularly real-time systems. In this paper, a review of data collection techniques is presented, providing a comparison between the maximum amount shortest path (MASP) and zone-based energy-aware (ZEAL) data collection protocols implemented in the NS-3 simulator. Finally, the study provides a suitable data collection strategy that satisfies the requirements of WSN applications in terms of data delivery, energy consumption, and the time required for data collection

    Improved White Blood Cells Classification based on Pre-trained Deep Learning Models

    Get PDF
    Leukocytes, or white blood cells (WBCs), are microscopic organisms that fight against infectious disease, bacteria, viruses, and others. The manual method to classify and count WBCs is tedious, time-consuming and may has inaccurate results, whereas the automated methods are costly. The objective of this work is to automatically identify and classify WBCs in a microscopic image into four types with higher accuracy. BCCD is the used dataset in this study, which is a scaled down blood cell detection dataset. BCCD is firstly pre-processed by passing through several processes such as segmentation and augmentation,then it is passed to the proposed model. Our model combines the privilege of deep models in automatically extracting features with the higher classification accuracy of traditional machine learning classifiers.The proposed model consists of two main layers; a shallow tuning pre-trained model and a traditional machine learning classifier on top of it. Here, ten different pretrained models with six different machine learning are used in this study. Moreover, the fully connected network (FCN) of pretrained models is used as a baseline classifier for comparison. The evaluation process shows that the hybrid between MobileNet-224 as feature extractor with logistic regression as classifier has a higher rank-1 accuracy with 97.03%. Besides, the proposed hybrid model outperformed the baseline FCN with 25.78% on average
    corecore